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Discrete-Valued Control by Sum-of-Absolute-Values 

Optimization 

Takuya Ikeda, Masaaki Nagahara, Senior Member, IEEE, Shunsuke Ono, Member, IEEE 


Abstract —In this paper, we propose a new design method of 
discrete-valued control for continuous-time linear time-invariant 
systems based on sum-of-ahsolute-values (SOAV) optimization. 
We first formulate the discrete-valued control design as a finite- 
horizon SOAV optimal control, which is an extended version 
of optimal control. We then give simple conditions that 
guarantee the existence, discreteness, and uniqueness of the SOAV 
optimal control. Also, we give the continuity property of the value 
function, hy which we prove the stability of infinite-horizon model 
predictive SOAV control systems. We provide a fast algorithm for 
the SOAV optimization based on the alternating direction method 
of multipliers (ADMM), which has an important advantage in 
real-time control computation. A simulation result shows the 
effectiveness of the proposed method. 

Index Terms —Discrete-valued control, optimal control, convex 
optimization, model predictive control. 

I. Introduction 

Discrete-valued control is a control mechanism that achieves 
control objectives (e.g. stability) with control inputs taking 
values in a finite alphabet (e.g. bang-bang control: 1-bit control 
taking ±1). Discrete-valued control has a significant advantage 
in networked control in which control signals are quantized 
and transmitted through networks (see e.g. CD; since discrete¬ 
valued control signals need not be quantized, no quantization 
error may occur. Also, discrete-valued control has important 
applications in DC-DC conversion ||2l, class D amplifier |0, 
hybrid power system a, train control a, hormone therapy 
a, to name a few. 

A standard design method for discrete-valued control is 
mixed-integer programming a. Although this directly gives 
discrete-valued control, this method requires heavy compu¬ 
tation, and hence it can be used only for relatively slow 
plant such as a gas supply system reported in a . A more 
tractable method is dynamic quantization proposed in 0. IS). 
In this approach, a dynamic quantizer is designed such that the 
quantizer mimics the ideal (i.e. no quantization) continuous 
output, and the state space representation of the dynamic 
quantizer is given in a closed form. This method, however, 
assumes an infinite alphabet (e.g. the set of integers, Z). 
Another approach is the control parametrization enhancing 
transform proposed by Eol, in which the optimal switching 
times of a piecewise-constant (i.e. discrete-valued) control 
input are computed. This approach assumes that the number 
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of switching is previously known, which is in practice hard to 
obtain. 

Alternatively, we propose a novel method for discrete¬ 
valued control based on the idea of the sum-of-absolute-values 
(SOAV) optimization M. The proposed optimal control, 
which we call the SOAV optimal control, is an extended version 
of optimal control ifT^ (also known as the minimum fuel 
control lfT3l ). The SOAV optimization is convex and hence the 
solution can be obtained efficiently. In fact, as shown in Sec¬ 
tion |WB] the optimization is solved, after time-discretization, 
by the alternating direction method of multipliers (ADMM) 
m-ca, which is a simple but much faster algorithm for 
large scale problems than the standard interior point method 
ifTTl Chap 11]. 

For theoretical analysis, we prove the existence, discrete¬ 
ness, and uniqueness of the (finite-horizon) SOAV optimal 
control under simple conditions (e.g. the plant model is con¬ 
trollable, the A-matrix is nonsingular, and the finite alphabet 
for the control includes 0). The obtained discrete-valued con¬ 
trol is a piecewise constant signal, and we prove the number of 
discontinuities, or switching times, is bounded. This property 
is very important in particular for networked control since the 
upper bound of the number of switching times, which can be 
given before optimization, ensures the upper bound of the data 
rate required to transmit the discrete-valued control. 

We also prove that the value function, which is defined 
as the optimal value of the cost function of the optimal 
control problem, is a continuous and convex function of initial 
states. This property is applied to prove the stability of the 
model predictive control (MFC) feedback system based on 
the finite-horizon SOAV optimal control. As mentioned above, 
the SOAV optimal control can be obtained by the fast ADMM 
algorithm, and hence the control is well-adapted for MFC. 

The remainder of this paper is organized as follows: In 
Section HIl we give mathematical preliminaries for our subse¬ 
quent discussion. In section |III1 we formulate optimal control 
problem so that optimal controls have the desired discrete 
values. After that, we examine optimal controls, and lead the 
existence, discreteness, and uniqueness of the SOAV optimal 
control. A numerical optimization algorithm based on ADMM 
is also presented in this section. Section HV] investigates the 
continuity and the convexity of the value function in SOAV 
optimal control. Section [V] gives the model predictive control 
formulation and shows the stability. Section|yT]presents an ex¬ 
ample of model predictive control to illustrate the effectiveness 
of the proposed method. In Section IVIII we offer concluding 
remarks. 
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II. Mathematical preliminaries 

This section reviews basic definitions, facts, and notation 
that will be used throughout the paper. 

Let n be a positive integer. For a vector x € M" and a scalar 
e > 0, the e-neighborhood of x is defined by B{x,e) = {y € 
R" : ||y-a;|| < e}, where || • || denotes the Euclidean norm 
in K”. Let T” be a subset of K”. A point x £ X is called an 
interior point of X if there exists e > 0 such that S(x, e) C X. 
The interior of X is the set of all interior points of X, and we 
denote the interior of X by intT". A set X is said to be open 
if X = intX. A point x € K" is called an adherent point of 
X if B{x,£) n A" 7 ^ 0 for every e > 0, and the closure of 
X, denoted by A, is the set of all adherent points of A. A 
set A C R." is said to be closed if A = A. The boundary of 
A, denoted by dX, is the set of all points in the closure of 
A, not belonging to the interior of A, i.e., dX = X — intA, 
where Ai — X 2 is the set of all points that belong to the set 
Ai but not to the set A 2 . In particular, if A is closed, then 
dX = A — intA, since A = A. A set A C R" is said to be 
convex if, for any x,y £ X and any A S [0,1], (1 — X)x + Xy 
belongs to A. 

A real-valued function / defined on R" is said to be lower 
semi-continuous on R" if for every a € R the set {a; £ R” : 
f{x) > a} is open. It is known that if a function / is lower 
semi-continuous on R", then 

f{x) < liminf/(y) 

y^x 

for every x £ R" ifTSl pp. 32]. A real-valued function / defined 
on a convex set C C R” is said to be convex if 

/((I - A)a; -f Xy) < (1 - X)f{x) + Xf{y), 

for all X, y £ C and all A £ (0,1). 

Let T > 0. For a continuous-time signal u{t) over a time 
interval [0,T], we define its and L°° norms respectively 
by 

Iklli - [ \u{t)\dt, and ||m||oo = ess sup|M(f)|. 

Jo te[o,T] 

We denote by m the Lebesgue measure on R. 

III. Discrete-valued Control Problem 

In this paper, we consider a linear time-invariant system 
represented by 

x{t) = Ax{t) -I- Bu{t), f > 0 (1) 

where x{t) £ R”, u{t) S R, A S R"^", and B £ R"^^. We 
here assume single-input control for simplicity. For the system 
O, we assume discrete-valued control, that is, the control u{t) 
can only take values in a fixed finite set (or finite alphabet) 

U^{±[/i,±C/ 2 ,...,±C/jv} (2) 

where Ui,..., Un are non-negative real numbers satisfying 

0 < = Ui < [/2 < • • • < t/jv = 1. (3) 

Here we assume the maximum value Um = f without loss 
of generality (otherwise, use B/Un instead of B in ([Til). Let 


an initial state ^ £ R" and a finite time T > 0 are given. 
The control objective is to obtain a discrete-valued control 
u{t) £ U for t £ [0,T] that steers the state x(t) from the 
initial state ^ to the origin at time T. We will show in this 
paper that such a discrete-valued control can be efficiently 
obtained by sum-of-absolute-values (SOAV) optimal control 
described below. 

SOAV optimal control is an extended version of optimal 
control (also known as minimum-fuel control ini). Let denote 
by U{^) the set of all feasible controls that satisfy x(0) = 
x{T) = 0, and ||u||oo < 1 for the system ([T]i. We assume 
that if (5) is non-empty. This assumption is satisfied if T is 
greater than the minimum time T* of the time optimal control 
(dll. Then the optimal control is a control that minimizes 
the cost function ||m||i among all feasible u £ U(f). It is 
known that the optimal control takes only 0 and ±1 when 
the system dD is normal, that is, the coefficient matrix A is 
non-singular and the pair {A, B) is controllable 1(13] Theorem 
6-13]. In other words, if (|T]i is normal, then the optimal 
control gives a discrete-valued control on U with Ui = 0 and 
1/2 = 1 (N = 2). To extend this idea to a general set U as in 
(Ell, we consider the following SOAV cost function: 

N 

J{u) ='^Wi(j)i{u), 4>i{u) = \\u - Ui\\i\\u -£ Ui\\i (4) 

i=l 

where wi,... ,wn are weights satisfying rui -|- rt ;2 -f • • • -b 
zuat = 1. The motivation for this cost function is based on 
the observation that if u{t) = Ui on a set I C [0,r], then 
u{t) — Ui = 0 on X, which is sparse and reduces the norm 
IIw “ Hi 111 discussed in II2. 

Let us formulate the associated optimal control problem as 
follows. 

Problem 1 (SOAV optimal control problem): For a given 
initial state ^ £ R", find a feasible control u £ U{^) that 
minimizes the SOAV cost function J{u) given in (|4]i. 

We will show that under some assumptions on the system 
dD and the initial state the SOAV optimal control takes its 
values in the set U. 

A. Existence 

Here we show the existence theorem for the SOAV optimal 
control. 

Let us define the reachable set of initial values from which 
the state x{t) in dD is steered to the origin by some control 
u{t), t £ [0,T] with ||u||oo < 1- 

Definition 1 (reachable set): For the system dD, the reach¬ 
able set TZ at time T is defined by 

7^=|^ : ||u||oo < l| C R”. 

Then we have the following existence theorem. 

Theorem 1 (existence): For each initial state in the reachable 
set TZ, there exists an SOAV optimal control. 

Proof: Let an initial state ^ £ TZho fixed. The feasible 
control set lA{£f) can be described by 

U{^) = : J e~^*'Bu{t)dt = —HitHoo < 1 


( 5 ) 
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Since the set IA{^) is non-empty, we can define 


B. Discreteness of SOAV optimal control 


0 = inf{ J(u) : u G W(^)}. 

Then there exists a sequence C such that 

limj^oo J{ui) = 0, ||mz||oo < 1, and 

^ = — [ Bui{t)dt. (6) 

Jo 

Since the set {u € L°° : ||m||oo < 1} is sequentially compact 
in the weak* topology of L°° Il20l Theorem A.9], there exist 
a measurable function Uoo with ||u||oo < 1 and a subsequence 
{u;/} such that {u;/} converges to u^o in the weak* topology 
of L°°, that is, we have 


Here we show the SOAV optimal solution is a discrete¬ 
valued control on U. The following theorem is one of the 
main results. 

Theorem 2 (discreteness): Assume that the coefficient ma¬ 
trix A is non-singular and the pair (A, B) is controllable. If an 
SOAV optimal control u* exists, then either of the followings 
holds. 

(i) u*{t) G U for almost all t G [0,r]. 

(ii) ||u*||oo < C^min- 

In particular, if C/min = 0, then u*(t) takes a value in U for 
almost all t G [0, T]. 

Proof: The Hamiltonian H for the SOAV optimal control 
problem is defined by 


0 

for any / G L^. Since (|6]l satisfies for I = we have 

C = - / e~^^Buoo{t)dt 

Jo 

and hence Uoo GU{^). Put 

N T 

Jp - X! Ui)sgn{uoo{t) - Ui)dt (8) 

i=i -^0 

where the function sgn is defined by 
sgn(u) = 


v/\v\, if u 7 ^ 0 , 
0, if u = 0. 


From (|7]i, we have 

lim =y^Wi\\uoo±Ui\\i. 

I'—foo 


N 


2=1 


Let Ji' = Jjt + Ji, ■ Then the above equation gives 
lim Jii = lim (J^t -f J;7) = J('Uoo)- 

I'—fOO I'—fCX) 

Also, from (IDi and (|8]l, we have 


N 


Jv < \ Ji'\ < = J{uv) 


(9) 


( 10 ) 


for each /' G N. Since the sequence {J(u;)} converges to 6 
as / —cx), the subsequence {J{ui')} has the same limit 6. 
Therefore we have 


J{uoc)= lim Jr < lim J{ur)=9 (11) 

l'—¥00 I'—fOO 

from (|9]l and ( fTOl i. On the other hand, since Uoo G U{^), we 
have Jiuoc) > 9. This with (fTTT i. we have J(uoo) = 9, and 
Uoo is an optimal control for the initial state ■ 


iT(x, u,p) = L{u) -\- p"^{Ax + Bu) 


where 

N 

L{u) = '^Wi{\u - Ui\ -\-\u-\- U^\) 

2=1 

and p is the costate vector. Let x* denote the trajectory cor¬ 
responding to u*. From Pontryagin’s minimum principle ifTSll . 
there exists a costate vector p* satisfying H{x*,u*,p*) < 
H{x*,u,p*), or 

L{u*) + {p*)'^Bu* < L{u) + {p*f^Bu (12) 

for every u with |u| < 1. Therefore, the optimal control is 
the minimizer of the right hand side of (fTSli . which can be 
obtained analytically as follows. 

An elementary computation yields 

{ -QkU + bk, if u G [-Uk+i,-Uk], 

if ue (13) 

akU + bk, if u G [Uk,Uk+i] 


for k = 1,2,..., N — 1, where 


k 

Ofc = 2y^Wi, 

bk 

N 

= 2 E 

WiUi, 

k = 

1,2,...,iV 

2=1 







Put q{t) = p* 

it? 

B 

G K, f{u) 

^L{u) 

+ qu, and 

1 

\ —Qn- 

-k + q, k 

= 1,... 

,N- 

-1, 

Cfc = 

\q, 


k 

= N, 



1 


-N 

+ q, k 

= )V + 

1,.. 

.,2N-1. 


From (fl^ and (O, it is easily shown that f{u) is continuous 
and Cl < C 2 < • ■ • < C 2 n-i- Then we have the following. 

1) If Cl > 0, then 

arg min/(M) = —Un = —1. 

\u\<l 

2) If Cfc < 0 and Cfc+i > 0 for /c S {1,2,..., 2N — 2}, 
then we have 


Cl < • • • < Cfc < 0 < Cfc+i < • • • < C2N-1- 
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This implies that 


— UN-k, 


arg min/(M) = < Umin, 

I“I<1 TT 

Uk-N+1, 

3) If C 2 N -1 < 0, then we have 


k = l,...,N -1, 
k = N, 

k = N+l,...,2N-2. 


Cl < C2 < • • • < C2Ar-l 


and hence 

arg min/(M) = Un = 1. 

\u\<l 

4) If Cfe = 0 for k G {1, 2,..., 2N — 1}, then we have 
arg min/(u) 

|«|<i 

r [—C/AT-fc+i, —C/AT-fc], A: = 1,..., iV — 1, 
e < k = N, 

l[[/k-N,t7k-N+i], k = N+l,...,2N-l. 

In this case, the minimizer of f(u) is not determined 
uniquely. 

In summary, the minimizer of f(u), that is the SOAV optimal 
control M*, is given by 


-1, 

1 

A 

1 

1 

if QN-k-i < q{t) < atv- 

^min? 

if 0 < q{t) < oi, 

^min 5 

if — Oi < q{t) < 0, 

Uk-\-l, 

if - Ofc+i < q{t) < -ak, 

1, 

if q{t) < -aAr_i 


where k = 1,2,..., N — 2 and 

{ [—Ujv-k+i,if Q(i) = ajv-k, 

if 9(0 = 0, (15) 

if q(t) =-Ok 

where A: = 1,2,..., — 1. 

Next we claim that 

m({t € [0,T] : q(t) = ±ak}) = 0 (16) 

for every k G {1,2, — 1}, where m denotes the 

Lebesgue measure. Let k G {1, 2,..., — 1} be fixed, and 

assume m{{t G [0,T] : q{t) = Ofc}) > 0. Then we have 

q{t) = Qk (17) 

on a set C [0,T] with m{E) > 0. From Pontryagin’s 
minimum principle, we have 

p*{t) = - AV(0 (18) 

for t G [0,T], and hence we have p*{t)'^AB = 0 for t G E 
by differentiating ( fTTb . Again, by differentiating this equation, 
we also have p*{t)'^ A^B = 0 for t G E from (fTSl l. Repeating 
this yields p*{k)^AB = 0 on for every I G N. Therefore 
we have 

p*(t)'^A[B AB ... A"-ib]=0 (19) 


for t G E. Since 7^ 0 for every k G {1,... ,N — 1}, it 
follows from (fTTl i that p*{t) is not identically 0 on [0, T], and 
hence the determinant of A[B AB ... A^~^B] is 0. However, 
this contradicts to the assumption that the matrix A is non¬ 
singular and the pair (A, B) is controllable. Therefore m({A G 
[0, T] : q{t) = Ofc}) = 0 holds for every A: S {1,..., A^ — 1}. 
Similarly, we can also prove that m{{t G [0,T] : q{t) = 
—flfc}) = 0 for every k G (1,2,..., A^ — 1}, and hence (O 
holds for every A: G {1, 2,..., A^ — 1}. 

Next, let assume m{{t G [0,r] : q(t) = 0}) > 0. Then 
we have p*{t)'^B = 0 on a set F C [0,r]. From (fTsT l. by a 
similar computation as above, we have the relation ( fT^ for 
t G E. Since the matrix A[B AB... A^~^B\ is non-singular 
from the assumption, it follows that 

p*{t) = 0, Vi G E. (20) 

Since we have p*{t) = e~^ *po on [OjU for some po £ 
K" from dH, it follows from (l20l) that po = 0, and hence 
p*{t) = 0 on [0,r]. Then q{t) =p*{t)^B = 0 on [0,r], and 
we have |1 m*||oo < Ui = Lmm from (fTsT l. Therefore, if q{t) is 
0 on a set with positive measure, then the optimal control u* 
satisfies |1 m*||oo < Umin, and otherwise, the optimal control 
takes discrete values ±?7i,..., EUn for almost all t G [0, T]. 

■ 

Theorem|2]suggests that if f/min = 0 then the SOAV optimal 
control is a discrete-valued control that takes values in U. 
Otherwise, it is useful to derive a condition for the optimal 
control u* to satisfy the statement (i) in Theorem |2] In fact, 
it will be shown that there exists a subset of R" such that if 
an initial state ^ is in this set then the SOAV optimal control 
takes values in U. To derive such a subset, we prepare the 
following lemmas. 

Lemma 1: The cost function J has the minimum value 

N 

Jmin = 2T'^ WiU^ 

on the set {u G : HuHoo < !}■ 

Proof: See Appendix lAl ■ 

Lemma 2: Let TZnnn be the set of all initial values at which 
the optimal value is equal to Jmin- Then we have 

72-min = I y e~^*Bu{t)dt : ||u||oo < t^min 

In particular, if t/min = 0, then we have TZnnn = {0}. 

Proof: See Appendix IbI ■ 

Now let us state the following theorem on the discreteness 
of the SOAV optimal control. 

Theorem 3 (discreteness for nonzero l/min)-' Assume that 
A is non-singular and the pair (A, B) is controllable. If ^ G 
TZ — T^min, then the optimal controls take values in U almost 
everywhere in [0, T]. Otherwise, if G 7?.min. then the optimal 
controls take values less than or equal to Lmin on [0 ,r]. 

Proof: This follows from Theorem[Tl Theorem|2] Lemma 
[U and Lemma |2l immediately. ■ 

Next, we show the uniqueness theorem of the SOAV optimal 
control. 

Theorem 4 (uniqueness): Assume that A is non-singular and 
the pair (A, B) is controllable. Then the SOAV optimal control 
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for the initial state ^ € TZ — T^min is unique. In particular, if 
Umin = 0, the SOAV optimal control is unique for any initial 
state ^ gTZ. 

Proof: Fix an initial state ^ GTZ — T^min, and let ui and 
U 2 be optimal controls for Then we have 

J{ui) = J(lt 2 ) < J{u) (21) 

for all u G U{^). For any A G (0,1), the control Aui + (1 — 
A)m 2 is feasible for and hence the convexity of the cost 
function J and (|2T]) yields 

J(mi) < J(Ami+(1—A)m 2 ) < AJ(ui)+(l—A)J(u 2 ) = J{ui) 

which means J(Aiti + (1 — A)it 2 ) = J{ui). Therefore Ami + 
(1 — X)u 2 is an optimal control for 
Put 


£^1 = {<e [0,T] :ui(f) gU}, 

E 2 = {tG[0,T]:u2{t)Gl]}, 

F = {El n E2 ■■ Ui{t) = U2it)}, 

G = {El n E 2 : Ui{f) ^ U 2 {t)}. 

From theorem [3 we have m{Ei) = m{E 2 ) = T, and then 
we also have m{Ei Cl E 2 ) = T. 

Here, there exist some A G (0,1) such that Xui{t) + (1 — 
X)u 2 {t) ^ U for any t G G, from the gap between the 
uncountability of (0,1) and the countability of U. It follows 
from the optimality of the control Atti + (1 — A)tt 2 for such 
A and Theorem [3] that m{G) = 0. Therefore we have 

m{F) = m{F) + m(G) = m{Ei n E 2 ) = T, 

and then 

T = m{F) < m{{t G [0,r] : Ui(t) = U 2 (t)}) < T, 

which yields 

m({t G [0,r] : ui(t) = U 2 (t)}) = T. 

This means the uniqueness. ■ 

From the above discussion, the SOAV optimal control can 
give a discrete-valued control taking values in U under some 
assumptions on the system O and the initial state A 
discrete-valued control is a piecewise constant signal, and 
changes its value at switching instants. It is undesirable for real 
applications if the number of switching were infinite, however 
it never happens. In fact, we have the following theorem that 
gives an upper bound of the number of switching. 

Theorem 5 (number of switching): Assume that A is non¬ 
singular and the pair (A, B) is controllable. Then the number 
M of switching of the SOAV optimal control for each initial 
state ^ G TZ — T^min satisfies 

M <n{2N-l){l + nT/Tr), (22) 

where O is the largest imaginary part of the eigenvalues of A. 
In particular, if C/min = 0, then the number M of switching 
for each initial state ^ G TZ satisfies 

M <2n{N-l){l + flT/Tr). (23) 

Proof: Fix arbitrarily an initial state ^ G TZ—TZmin, and let 
u* be the SOAV optimal control for the initial state From 


the proof of Theorem |2] u* (f) has discontinuities at points 
such that q{f) = p*{t)^B G S = {0, ±ai,..., ±aAr_i}, 
where is defined in (fl4li . Take arbitrarily an element a G S. 
Since p*{t) = e~^ *po for some po G M", we have q(t) = 
PQe~^*B. Therefore q(t) = a implies a — p"^e~^*B = 0, or 


[a -pj] exp 


0 0 
0 A 


1 

B 


= 0 . 


The number of zeros on [0, T] of the function on the left hand 
side is less than n(l + according to II 2 TI . Counting 

the elements of the set S = {0, ±ai,..., ±aAr_i} yields the 
estimate (l22l i. In particular, if Umin = 0, the switching instants 
of the SOAV optimal control for ^ gTZ consist of all t such 
that q{f) G {±ai,..., ±aAr_i}, and the estimate (l23t holds. 


IV. Value function 

In this section, we investigate the value function in the 
SOAV optimal control. The value function is the optimal value 
of the SOAV optimal control, defined by 

V{f) = min{J(M) : u G G(^)}, ^ G TZ 

where J{u) is defined in (|4]i. From the existence theorem 
(Theorem [T]i, this is well-defined. In this section, we will 
show the continuity of the value function V(^). This property 
plays an important role to prove the stability when the optimal 
control is extended to model predictive control (see Section 
|V] below). To prove the continuity, the following lemmas are 
fundamental. 

Lemma 3: The value function V (^) is convex on 7Z. 

Proof: See Appendix ICl ■ 

Lemma 4: For a > Jmin, let 

TZa “ I y e~^^Bu{t)dt : ||u||oo < 1, J{u) < ot 

Then the set TZa is closed for every a > Jmin- 

Proof: See Appendix IdI ■ 

Lemma 5: If the pair (A, B) is controllable, then we have 

= {^ V{0 < 2T}. 


In particular. 


dTZ = {£,: V{£) = 2T). 


Proof: See Appendix |E] ■ 

From these lemmas, we show the continuity of the value 
function V(5) based on the discussion given in ll22l . 

Theorem 6 (continuity ofV): If the pair (A, B) is control¬ 
lable, then V (f) is continuous on TZ. 

Proof: Define 




no, esTe, 

2r, ^ G R" - 7^. 


It is sufficient to show that V(0 is continuous on R". 

First, we show that V(^) is continuous at every ^ G dTZ. 
Fix ^ G dTZ. For a that satisfies a > 2T or a < Jmin, the 
set : V(^) > a} is empty or R", respectively. For a with 
Jmin < a < 2T, we have 


{e : V(e) > a} = R" - V(e) < a} 
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which is open from Lemma |4] since 

U:F(^)<a} = U:nO<«} = ^a. 

It follows that the set : L(^) > a} is open for every real 
number a, and hence V{^) is lower semi-continuous on R". 
Then we have 

y{0 ^ liminf L(? 7 ). (24) 

On the other hand, we have 

limsupL(77) < 2T, (25) 

from Lemma |5] Therefore, 

V{^) < liminf l/(p) < limsupL(p) <2T = V{^) 

from (l24li . (|25] |. and Lemma |5] This yields 
V (^) = lim V (p) 

which means that V (^) is continuous at every ^ G dTZ. 

Since L(5) is convex on TZ from Lemma [3] and TZ contains 
the origin in its interior from the controllability of the pair 
{A, B) 119] Theorem 17.3, Corollary 17.1], V{^) is continuous 
at every point in int?^ 123 Theorem 10.1, p.44]. 

Therefore V{^) is continuous on R”, and then L(^) is 
continuous on TZ. ■ 


V. An extension to MFC 

In this section, we extend the finite-horizon SOAV optimal 
control discussed above to infinite-horizon model predictive 
control (MFC). 

Suppose that we are given a sequence {ffc}fceN of sampling 
instants. We assume that 

0 = fo < L < ^2 < • • ■ (26) 

and there exists r > 0 such that 

T <tk+i-tu<T, fc = 0,l,2,... (27) 

We also assume that the initial state x(0) = tq S 7^ is given. 
At each sampling instant tk, the SOAV optimal control, say 
Uk, with horizon length T is computed by solving the SOAV 
optimal control problem (Froblem [T|i with ^ = x{tk). We 
then apply Uk{t) on the fc-th time interval [tk,tk+i]- If each 
optimization has the optimal solution, then this process gives 
a control 

u{t) =Uk{t-tk), tG[tkAk+i], fc = 0,l,2 ,... (28) 

From the assumptions (|26] | and (l27l i. the control u{t) is dehned 
for all t G [0, oo). 


A. Stability 

Here we investigate the stability of the closed-loop system 
with the model predictive control given in (|28] |. More precisely, 
the question here is whether the origin is stable in the sense of 
Lyapunov regardless of the choice of sampling instants {tk} 
with the control (l28l l. 

Note that the state x{t) for t G [0,fi] obviously exists in 
the reachable set TZ while the control itg is used, since every 
point out of the set TZ needs a time duration more than T 
to be steered to the origin by any control v with ||u||oo < 1 - 
Therefore, we have x{ti) G TZ, and the next optimal control ui 
exists on the next interval [ti,t 2 ]. Then the state x{t) for t G 
[L, ^ 2 ] lies in the reachable set TZ while the control ui is used. 
It follows that the state x{t) lies in the reachable set TZ for 
all t G [0, 00 ) under this situation and each optimization has 
the optimal solution, and hence the control u is well defined. 
Then the continuity of the value function (Theorem | 6 ]l leads 
to the stability of the closed-loop system, as described in the 
following theorem. 

Theorem 7 (stability): If the pair {A,B) is controllable and 
Umin = 0, then the origin is stable in the sense of Lyapunov 
regardless of the choice of the sampling instants 
that satisfy (l26l l and dZTl i when we use the control u dehned 
in d^ . 

Proof: Fix a sequence of sampling instants that 

satisfy (l26l l and dZTl i. Also hx a positive real number e > 0. 
We can take a real number r G (0, e) such that 

S. 4 G R" : lien < r} C 

since TZ contains the origin in its interior from the controlla¬ 
bility of the pair {A, B). From Theorem| 6 l V{^) is continuous 
on dBr, and then we can dehne 

a = min V(e)- 

From Lemma 121 we have V{^) > Jmin for the initial state 
e 7 ^ 0, and hence a > Jmin- Take f] G {Jmin, ot). Then the set 
TZjsDdBr is empty, and TZp contains the origin and is convex. 
Hence we have TZis C intS^- From the continuity of L(^) at 
the origin, there exists i5 > 0 such that ||,f|| < <5 implies 

Jmin < V{0 < P. (29) 

When we use the control u dehned in (l28l l for f with ||,^|| < S, 
it is clear that we have 

V{x^{t)) < ViO, ft > 0 (30) 

where x^{t) is the state with t^( 0 ) = ^ and is obtained by 
using u. Therefore for ^ with ||^|| < ^ we have V{x^{t)) < P 
for all f > 0 from (l29l l and (l30l l. Since TZp C Br C Be, for 
any initial state ^ with ||^|| < S we have X((t) G Be for all 
t > 0, which means that the origin is stable in the sense of 
Lyapunov. ■ 

B. Numerical Optimization 

Here we propose a numerical computation algorithm to 
solve the (hnite-horizon) SOAV optimal control problem to 
obtain a discrete-valued control input. 
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For simple plants, such as single or double integrators, 
the discrete-valued control can be obtained in a closed form 
via Pontryagin’s minimum principle as the discussion in lfT3l 
Chap. 8] for optimal control. However, for general linear 
time-invariant plants, one should rely on numerical compu¬ 
tation. For this, we adopt a time discretization approach to 
solve the SOAV control problem. This approach is standard for 
numerical optimization; see e.g. Il24l Sec. 2.3]. We then derive 
an algorithm for the optimization based on the alternating 
direction method of multipliers (ADMM) m-QH. This 
algorithm is simple but much faster than the standard interior 
point method. 

We first divide the interval [0,T] into i/ subintervals, 
[0,r] = [0,/i) U • • • U [(i^ — l)h,uh], where h is the dis¬ 
cretization step chosen such that T = vh. We here assume 
(or approximate) that the state x{t) and the control u{t) are 
constant over each subinterval. On the discretization grid, 
t = 0,h,..., vh, the continuous-time plant ([TJ is described 
as 

Xd[l + 1] = AdXd[l] + BdUd[l], l = 0,l,...,v-l 
where Xd\l\ — x{lh), Ud[l] = u{lh), and 

Ad^e^’^, Bd= [ e^^Bdt. 

Jo 

Set the control vector 

Z 4 [ud[0],Ud[l\,...,Ud[v- 1]]"'". 

Let ^ be the initial state, that is, a;(0) = Then the final state 
x{T) can be described as 

x(r) = Xd[v] = C + $2 
where ( = and 

$ ^ [A'^^-^Bd, A'^^-^Bd, ..., Bd]£ 

Rename the discrete values in U as 

ri = —Un, r2 = —Un+ 1, ■ ■ ■ ,rL = Un 
where L = 2N and the weights for J{u) in (IHi as 
Pi=Pl = wn,P2 =Pl-i = wn-1, ■ ■ ■ ,Pn = Pn+i = Wi- 


1) Alternating direction method of multipliers (ADMM): 
We here briefly review the ADMM algorithm. The ADMM 
solves the following type of convex optimization. 

minimize f(^z) + giy) 

zGR'^1 ,2/GR'^2 ( 2 ) 2 ) 

subject to y = 

where / : i—;■ RU{cx)} and g : i—;■ RU{oo} are proper 

lower semi-continuous convex functions, and Jp € R^^^^L 
The algorithm of ADMM is given, for € R^^ and 

7 > 0, by 

+ 1] ^ arg min{/(2) -f ^\\y[j] --^z- w[j]\\'^} 
zGR^i 

y[j + 1] ^ pmx^g{'iiz[j -b 1] -b w[j]) 
w[j + 1] ^ w[j] + T'z[j + 1] - y[j + 1] 

(33) 

for j = 0,1,2 ,..., where prox..|,g denotes the proximity 
operator of yg defined by 

prox.^ (2) = arg min75(7/) -b - 7/|p. 

yGR"^ 2 

We recall a convergence analysis of ADMM by Eckstein- 
Bertsekas insi. 

Theorem 8 (Convergence of ADMM ^51/ ).' Consider the 
optimization problem (l32l i. Assume that is invertible 

and that a saddle point of its unaugmented Lagrangian 
Co{z,y,w) = f{z) + g{y) - (T'z - y)^w exists. Then 
the sequence {(-2[j],y[j])}jGN generated by Algorithm 
converges to a solution of (l32l) . 

2) Reformulation into ADMM-applicable form: In what 
follows, we reformulate our optimization problem described 
in (ImT) into the standard form in to apply ADMM. 

Let Hi = {z G R"^! 11^11^=0 < 1} be the unit-ball of the 
infinity norm, and H2 = {—C} be the singleton consisting of 
the vector —C,. Define the indicator function of a nonempty 
closed convex set by 

^ Jo, if z e H, 

^ Joo, otherwise. 

Then, we can rewrite the optimization problem dSTI) as 


Then the SOAV optimal control problem is approximated by 

L 

minimize 7 pdlz —rdLi 

zGR" ^' 


i=l 


(31) 


subject to ||z||f=° <1, $ 2 ; -b C = 0, 


L 

minimize Pi\\z - TiW^i + lq,i{z) + (34) 

2=1 

Introducing new variables j/i,..., yL +2 such that yi = z (i = 
1,..., L + 1), and yL +2 = ^z, we can translate d34l i into 


where || • H^i and || • H^oo are the and £°° norms in 
R*^, respectively. The optimization problem (IMT i is reducible 
to linear programming HD, and can be solved by standard 
numerical software packages, such as cvx with MATLAB 
El, El, based on the interior point method. However, for 
large scale problems, the computational burden of such an 
algorithm becomes heavy, and hence we give a more efficient 
algorithm based on ADMM. 


minimize - ri\\e^ + Ln^{yL+i) + Ln^{yL+ 2 ) 

zGM"i ,yGR"2 ^ 

subject to y = "^z 


( 35 ) 


where Ni = v, N 2 = {L+l)v+n, y = [yf ... yl_^_ 2 ^ £ 

and 

1 T _ .nT, N'^ X A^I 


T- = [/ ... / G 
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Finally, by setting 

f(z) = 0, 

L 

giy) = '^Pihi - + ini( 2 /L+i) + inAyL+2) 

i=l 

the optimization problem (IlSl l is reduced to the standard from 

of®. 

3) Computation: Since / = 0, the first step of (l33l l becomes 
strictly convex quadratic minimization, which boils down to 
solving linear equations, that is, 

z[j + 1] = arg min -'itz- w[j]f 

Z 6 R*' 

= -w[j]) 


where 


L + l 

- ^(ydj] - Wi[j]) + $'^(l/L+2[i] - WL+2\j])- 

Note that the inverse matrix ((L + 1)7 + ^ can be 

computed off-line. 

On the other hand, the second step of (l33T l can be separated 
with respect to each yi. For yi (i = we have to 

compute the proximity operator of the ii norm with shift r^, 
which is reduced to a simple soft-thresholding operation; for 


Control 



Fig. 1. Discrete-valued control by SOAV MFC. 


VI. Example 


In this section, we give an example of model predictive 
control based on the SOAV optimal control. Let us consider 
the plant model represented in 


x{t) 


0 

-2 


1 

-1 


c(f) 


t(f), 


t > 0. 


For this system, we consider the following SOAV optimal 
control problem: 


[Prox^pdl-nlli W](i) = +P™XT,p,|.|(Z(i) - Ti) 

= r^+ sgn(z(i) - ri(;)) max{|zp) - ri\ - jpi, 0} 

where (•)(;) denotes the l-th entry of a vector. Here we use 
the shift property of the proximity operator (see, e.g., El). 

For t/L+i and yL+ 2 , the computation of the proximity 
operators of the indicator functions are required. Since the 
proximity operator of the indicator function of a nonempty 
closed convex set fl equals to the metric projection Pq onto 
H, the updates of yL+i and yL +2 are reduced to calculating 
Pn, and Pq^, respectively. We can compute Pq, as follows: 


4 

minimize J{u) = Wi(j)i{u) 

i=l 

subject to a:(0) = [5,5]"’’, a:(5) = 0, ||u||oo < 1 

where Wi = O.li (i = 1,2, 3, 4), and Ui = Pmin = 0, P 2 = 
0.2, U3 = 0.6 and U4 = 1. The sampling instants are taken as 
ti = 4, ^2 = 8, fa = 9 and ^4 = 10. Fig. [T]shows the control u 
defined by ( l28l l and Fig.|2]shows the state trajectory according 
to u. Certainly, we can see that the control u takes only discrete 
values 0 , ± 0 . 2 , — 0.6 and — 1 , and the state converges to the 
origin. 


z, if \\z\\eo. < 1 , 

z, otherwise 

where 

5 = [sgn(z(i)) min{|z(i)|, 1 } ... sgn( 2 :(j,)) min{|z(,,)|, 1 }]^. 
Meanwhile, Pq^ = P{-q is simply give by 
PnAz)^-C 

As addressed in na, ADMM tends to converge to modest 
accuracy within a few tens of iterations. This property is 
favorable in real-time control systems. 


VII. Conclusion 

In this paper, we have proposed sum-of-absolute-values 
(SOAV) optimization for discrete-valued control. We have 
shown the existence and uniqueness theorems of the SOAV 
optimal control. We have also given conditions for the SOAV 
optimal control to generate a discrete-valued control signal. 
The obtained discrete-valued control has a finite number 
of switching, of which an upper bound has been derived. 
Furthermore we have investigated the continuity of the value 
function, by which the stability has been proved when the 
(finite-horizon) SOAV optimal control is extended to model 
predictive control (MFC). For MFC, a fast algorithm based on 
ADMM is proposed. A simulation result has been illustrated 
to show the effectiveness of the proposed method. 
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state variables 



Fig. 2. State x{t) = [xi(t), X 2 {t)]'^ 


It follows from (l37l i that 


> 2C/J 7 + ^ (7+ + ) j = 2C/,T 


and hence 


N 

J{u) > 2T^ WiUi = Jmin- 


Therefore the cost function J(u) takes values greater than or 
equal to Jmin, and J{u) attains the minimum Jmin when u = 


0 . 


Appendix B 
Proof of Lemma|2] 


Fix an initial state 

rT 


^ S I y e : 


^ Umir 


1 00 _ ^ min 


Appendix A 
Proof of Lemma[T] 

First, we show that the value of the cost function J(u) for 
each u € {u € : ||w||oo < 1} is greater than or equal to 

Jmin, and then we show the minimum Jmin is achieved by 
u = 0. 

Fix a control u with ||m||oo < Un = 1, and define 

E^{te [0,T] : -t/i < uit) < Ui}, 

E+ ^{tG [0, T]:Uk< uit) < C/fc+i}, (36) 

E^ ^{tG [0,T] : -Uk+i < u{t) < -Uk} 

where fc = 1, 2,..., iV — 1. Let 7 = m{E) and 7^ = m{E^). 
Since these sets are pairwise disjoint and satisfy 


Then there exists a control u satisfying 

^ = / e~^^Bu{t)dt, ||u||oo < 

^0 

Since —u is feasible for ^ and —u{t) G [— t/min, Umin] for 
almost all t G [0,T], we have 

N 

J{-U) = 2T'^WiUi = Jmin- 

i=l 

It follows that —u is an optimal control, and hence ^ G 72.min- 
Conversely, take an initial state ^ G 7?.min and let u* denote 
the optimal control that satisfies J{u*) = Jmin- Define sets 
E'^, E^ and E as in the proof of Lemma [T] Then we can 
easily show that 


N-l 

Eli U(j;+u£;,-) = [o,r] 

k^l 

we have 

N-l 

l + T .(^ k + lk)=T (37) 

k=l 

from the countable additivity of the Lebesgue measure. Let 


/ {±u*{t) -Uk)dt = 0 

for every fc = 1, 2, ..., N— 1. Since u*(t) — Uk and — 

Uk are positive on E'^ and E^ for every k respectively, we 
have m{E^) = 0 for every k. Therefore m{E) = T from 
(ITTI i. that is, || — u*||oo ^ Ui = C/min- Also, since the control 
u* steers the initial state ^ to the origin at time T, we have 


= ± 


u{t)dt. 


IE? 


An elementary computation yields 

rT 


4>i{u) = f {\u{t) - Ui\ + \u{t) + Ui\)dt 
do 

, i-\ s N-l 

= 2t/J 7 + ^(7fe +7fc ) j + 2 ^ (A^ + Afc ) 

^ L—1 ' U —o 


for i = 1, 2 ,..., A^, where we define X]fc=i = 0 J2k=N — 
0. Then for k = i,i + 1, ■ ■ ■, N — 1, we have 

Afc = ± / u{t)dt > Ukik ^ tAi7±. 
dEt 


^ = [ e ^*B(^—u*{t)'^dt 
da 


and it follows that 


^ € I y e ^^Bu{t)dt : ||m||oo < C7min|- 


Appendix C 
Prooe of Lemma[3] 

Fix initial states r] G TZ and a scalar A G (0,1). From 
Theorem [1] there exist optimal controls and Urj for the 
initial states ^ and 77, respectively. Then we have A^ + (1 — 
X)r] G TZ since TZ is convex, and the control Ait5 + (1 —A)it,, is 
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feasible for the initial state + (1 — X)r]. From the convexity -q S dTZ such that ^ = Ary. As we proved above, we have 
of (pi in J(u) (see (IHi), we have V (ry) = 2T. It follows from the convexity of V that 


F(A^ + (1 — A)ry) < J[Xu^ + (1 - X)u^) 

N 

= ^ Wi(pi{Xu^ + (1 - X)Urj) 

i=l 

N 

- + (1 - 
i=l 

= AJ(u{) + (1 — X)J{utj) 

= XV{0 + {l-X)V{q). 


Appendix D 
Proof of Lemma|4] 


First, we note that the set TZa is well defined for a > J^in 
since Jmin is the minimum of the cost function from Lemma [T] 
Fix a > Jmin, and take a sequence {^i} in TZa that 
converges to ^oo G R"- It is sufficient to show that ^oo € TZa- 
For each G TZa, there exists a control ui such that 

6 = / e~^^Bui{t)dt, llii/lloo < 1, Jiui) < a. 

Jo 


Since the set {u € : ||m||oo < 1} is sequentially com¬ 

pact in the weak* topology of L°°, there exist a measurable 
function Uoo with ||uoo||oo < 1, and a subsequence {uf} such 
that {uii} converges to Uoa in the weak* topology of L°°. 
Clearly, we have 


^oo = [ e ^*Buoo{t)dt. 

Jo 


Define J,t as ® and J;, = J,t + J,, ■ Then we have 


J(ttoo) = lim Ji' < lim J(uiA < a 

l'—¥CO V—¥OC 

which is verified from (|9]l and (fTol i. It follows that ^oo S TZa- 


V{0 = V{Xq) < XV{q) + (1 - A)C(O) 

= 2Ar+(l-A)C(0) 

which yields 

2T < C(0) (39) 


since V{0 = 2T. 

However, since m = 0 is feasible for the initial state 0, we 
have 

N N 

2T'^WiUi < V{0) < J(0) = 2T^w^U^ 

2—1 2—1 

from Lemma [T] This implies 

N N 

V{0) = 2T'^w^U,<2T'^w, = 2T- (40) 

i=l i=l 

Thus a contradiction occurs between ( |39] | and (l40l i. and 
hence ^ ^ int?^. Since TZ is closed 1^ , we have ^ G dTZ- 
Next, we show 


= {^ V{0 < 2T}. 

From (l38l l. it is sufficient to show 

int7^ = < 2T}- (41) 

First, fix an initial state ^ G int?^, then there exist a scalar 
As [0,1) and a vector ry G dTZ such that ^ = Ary, and V (ry) = 
2T from d^ . It follows from (l40l) that 

V{0 < AV{q) + (1 - A)C(O) = 2AT + (1 - A)F(O) < 2T. 

Conversely, for any initial state such that V{^) < 2T, we 
have ^ G int?^ from ( l38b . Thus (l4Tl i follows, and the proof is 
completed. 


Appendix E 
Proof of Lemma[5] 

First, we show 

dTZ={^ : C(^) = 2r}. (38) 


Fix ^ G dTZ, then the feasible control for the initial state ^ is 
only the time optimal control, which is determined uniquely 
and takes only ±1 for almost all t G [0,T] since the pair 
(A, B) is controllable ll28l . ifT^ Theorem 12.1]. Let us denote 
the time optimal control by u*, and let F+, F~ C [0,T] be 
the set on which u* takes 1 and —1, respectively, that is. 


u*{t) 


1, if f e F+, 
-1, if tGF- 


and m{F~^) + m{F ) = T- Then we have 


N 

V{0 = = 2'^Wi{m{F+) + m{F-)) = 2T- 

i=l 


Conversely, fix an initial state ^ G TZ such that V (^) = 2T. 
If ^ S intT^, then there exist a scalar A G [0,1) and a vector 
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